■ . 1.. . • 

SPEECH CODER AND SPEECH DECODER 

CROSS-REFERENCE TO RELATED APPLICATION 

This application is a continuation of U.S. Application No. 10/133,735, filed April 29, 2002, 
which is a continuation of U.S. Application No. 09/319,933, filed June 18, 1999, issued as U.S. 
Patent No. 6,415,254 on July 2, 2002, which is the National Stage of International Application No. 
PCT/JP98/04777, filed October 22, 1 998, the content of which is expressly incorporated by reference 
herein in its entirety. The International Application was not published under PCT 21 (2) in English. 

Technical Field 

. 5 . The present-invention relates to a speech coder for. " 

efficiently coding speech information and' a sDeech 
decoder for efficiently decoding the 'same. 
Background Art- • . _ . ' 

A speech coding technique for efficiently coding- 

10 and. decoding . speech information has been develooed in 
recent years. • In Code Excited Linear ' Predict ion : "High 
Quality Speech at 'Low Bit Rate", M. R , Schroeder , Proc . 
ICASSP ' 85 ,.pp: 937-940, ther e is de s.crib ed a. speech coder 
of a CE.LP. type, which .is on the basis of such a speech - 

15 coding technique. 

In this speech coder, a linear prediction for an 
input . spee.ch. is carried out in every frame, which is 
divided at a, fixed time. . A prediction residual 
( excitation signal ) is "obtained by the linear predict ion • ' 

20 for each frame . Then , the prediction res.idual is codejl 
using an adaptive codebook in which a previous excitation 
. signal- is stored and. a random codebook in which a 
plurality of random code vectors^ is stored. 

FIG.'l shows a functional block of a conventional 

25 CELP type speech coder. 

A speech signal 11 input to the -CELP ' type speech 
coder is subjected, to a 'linear predict ion analys is in 
a linear prediction analyzing section 12. - A linear 
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predictive coefficients can be obtained by the linear 
prediction analysis. The linear . predictive 

coefficients are parameters indicating ah spectrum 
envelop of the speech s ignal 1 1 . The linear predictive 
■5 coefficients obtained in the linear prediction analyzing 
section 12 are quantized by a linear predictive 
coefficient coding- section 13, and the quantized linear 
• predictive coefficients are sent to a linear predictive 
coeff icient • decoding section 14. Note that an index 
10 obtained by this quantization is output to a code 
outputting section 24 as a linear predictive code. The 
linear, predictive coefficient decoding section 14. 
decodes the linear predictive coefficients quantized by 
the linear predictive coefficient coding section 13. so 
15 as to obtain coefficients of a synthetic filter. The 
■ linear predictive coefficient decoding section 14 
outputs these coefficients to a synthetic filter 15. 

An adaptive codebodk- 17 is one, which outputs a 
plurality o'f candidates of adaptive codevectors , and 
.20 which comprises a buffer for storing excitation signals 
corresponding to previous several frames,. The adaptive 
codevectors are time series vectors, which express 
periodic components in the input speech. 

A random codebook 18 is one, which stores a 
25 plurality of candidates of random, codevectors. The 
random code vectors are time series vectors, which 
express .non-periodic components in the input speech, 
In an adaptive code gain weighting section 19 and 
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a random code gain weighting section 20, the candidate 
vectors output from the adaptive codebook 17 and the 
random codebook 18 - are multiplied by - an adaptive, code 
gain read from a weight codebook 21 and a random code 
5 gain, respectively, and the resultants are output to an 
adding section 22. 

The weighting codebook stores a plurality of 
adaptive cpdebopk gains by which the adaptive codevector 
is multiplied and a" plurality of random codebook gains 

10 by which the random codevec.tors are multiplied. 

The adding section 22 adds the adaptive code vector- 
candidates and the random code vector candidates, which 
are weighted in the adaptive code gain weighting section 
19 and the random code gain weighting section 20, 

15 respectively- Then, the adding section 22 generates 
excitation vectors so as to be output to the synthetic 
filter 15 . 

The synthetic filter 1:5 is an all-pole filter. The . 
coefficients of the synthetic filter are obtained by the 

20 linear predictive coefficient decoding section 14. The 
synthetic filter 15 has a function of synthesizing input 
excitation vector in order .to produce synthetic speech 
and outputtihg that synthetic speech {o a distortion 
calculator 16. • 

25 A distortion calculator 16. calculates a distortion 

between the synthetic speech, which is the output of the 
synthetic filter 15, and the input speech 11, and outputs., 
the obtained distortion value to a code index specifying 



section 23. The code index specifying section 23 
specifies three kinds of . codebook " indicies (index of 
adaptive codebook, index of. random codebook, index of 
weight codebook) so as to minimize the distortion 
5 calculated by. the distortion calculation section 16. 
The three, kinds of codebook indicies specified by the 
code index specifying section "23 are output to a code 
outputting section 24. The code outputting sect:ion 24- 
outputs the index of linear predictive codebook obtained 
10 by the linear predictive coefficient coding section 13 
and the index of adaptive codebook, the index of random 
code/ the index of weight codebook, which have been 
specified by the code index specifying section 23, to 
.a transmi-ssion path at one time. 
15 FIG. 2 shows a functional block. of a CELP speech 

decoder, which decodes the speech signal coded by the 
aforementioned coder . In this speech decoder apparatus , 
a code input section. 31 receives codes sent from the 
speech coder (FIG: 1), The received codes are 
20 disassembled into the index of the linear predictive 
codebook, the index of adaptive codebook, the index of 
random codebook , and the index of weight, codebook . Then , 
the indicies obtained by the above disassemble are output, 
to a linear predictive coefficient decoding section 32, 
25 an adaptive codebook 33, a random codebook 34, and a 
weight codebook 35, respectively. ' 

Next, the linear predictive coefficient decoding 
section 32 decodes the linear predictive code^ number 



obtained by the eode input sect ion • 3 1 so as to obtain 
coefficients of the synthetic filter, and outputs those 
coefficients to a synthetic filter 3 9". Then, an adaptive 
codevector Gorre spending to the index . of adaptive 
5 . codebook is read from adaptive codebook, and a random 
codevector corresponding to the index of random codebook 
is read from the random codebook. .-Moreover, an adaptive 
codebook gain and a random codebook .gain corres'ponding 
to the index of weight codebook are read from the weight 
10 " codebook.. Then, in an adaptive codevector weighting 
section 36, the adaptive codevector is multiplied by the 

■ adaptive codebook gain, and -the resultant is sent to an 
adding section 38..' Similarly in- a random codevector. 
weighting section 37, the random codevector is 

15 multiplied by the random codebook gain, and the, resultant 

is sent to' the adding section 38 . 

.. The adding section 38 adds the^bove two codevectors 

and generates an exci tation-vector . Then, the generated 

excitation vector is sent to the adaptive codebook 33 
20 to update the buffer or the synthetic filter 39 to excite 

the filter. The synthetic filter 39, composed with the 

■ linear, predictive, coeffcients which are output from 
linear predictive coefficient decoding section 32, is 
excited by the excitation vector obtained by the adding 

25 section 38, and reproduces a synthetic speech. 

Note that, in the distortion calculator 16 of the 
CELP speech coder, distortion E is generally calculated 
by the following expression (1): 
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E ^\\v -{gaHP^gcHQ f {I) 

where v: an input speech signal (vector), 

H: an impulse response convolution matrix 
for a syn the tic filter 

' A(0) . 0 ••• ■■ 0 0 " • 
-/z(l) h(0) 0 ••• 0 0 
/i(2) h{V) /z(0) 0 0 0 

; - . . ; o o 

/i(L-l) ••• h{l) h(0) 

wherein, h is ah impulse response of a synthetic 
filter, L is-a frame length, 

p: an adaptive codevector, 
c: a random codevector, 
10 . ga: an adaptive codebook gain 

gc: a random codebook gain 

Here, in order to minimize distortion E of • 
expression (1) , the distortion is calculated by a closed 
loop with respective to all combinations of the adaptive 

15 code number, the random code number, the weight code, 
number, it is necessary to specify each code number. 

However, if the closed loop search is performed with 
respect to expression (1), an amount of calculation 
processing becomes too large. For this . reason, 

20 generally, first of all, the index of adaptive codebook 
is specified by vector quantization us ing the adaptive 
codebook. Next, the index of., random coodbook is 
specified by vector quantization using the random 
codebook. Finally, the index of weight codebook is 



5 // = 



.... 7 ■ 

specified by vector quantization using the weight 
codebook. Here, the . following will specifically 
explain the vector quantization processing using the 
random codebpok. 
5 In a case. where the index of adaptive' codebopk or 

the adaptive codebook gain are previous ly o r temporarily 
det ermi.ned , the expression for evaluating distortion 
shown in' expression (1) is changed to the foTlowing 
expression . ( 2 ) . , 

10 . ■ . Ec = II x-gcH.C f ■ " (2) 

where vector x in expression (2) is random 
excitation target vector for specifying a random code 
number which is obtained by the following equation (3). 
using the previously or temporarily specified adaptive 

15 codeyector and adaptivie codebook gain. 
X = v-gaHP ( 3 ) 

where ga: an adaptive codebook gain, 

V: a speech signal (vector)., ' 

. ? - 

H: an impulse response convolution matrix f.or a 
20 synthetic filter, 

p:. an adaptive codevector. 

For specifying the random codebook gain gc after 
specifying; the index of random codebook, it can be 
assumed that gc in the expression (2) can be set to an 
25 arbitrary value. For this reason, it is known that a 
quantization" processing for specifying the index of the 
random codebook minimizing the expression (2) can be 
• ireplaced with the determination of the index of the 
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random codebook vector maximizing the following 
fractional expression (4): 

VCfW . .. (4) 

In other words., in a case where the index of 

5 . adaptive .Godebook and the adaptive codebook gain are 

previously or temporarily determined, vector 

quantization pxocessing for random excitation "becomes 

processing for specifying . the index of the random 

codebook - maximizing . fractional expression (4) 

10 calculated . by the distortion calculator 16. 

In the CELP coder/decoder in the early stages, one 

-that stores kinds of random sequences corresponding to 

the number of bits allocated in the memory was used as 

a random codebook . However / there was a problem in which 

15 a massive amount of memory capacity. was required and the 

amount of calculation processing for calculating 

distortion of expression (4) with respect to each random 

■ - - p. 
codevector was greatly increased. 

As one of methods for solving. the above problem, 

20 there is a CELP speech code.r /decoder using an algebraic 

excitation vector generator for generating an excitation 

vector algebraically as described in "8KBIT/S ACELP 

■ CODING OF SPEECH WITH 10 MS SPEECH - FRAME : A CANDIDATE 

FOR CCITT STANDARDIZATION" : R. Salami, C. Laflamme, J-P. 

25 .Adoul, ICASSP'94, pp. 11-97-11-100., 1994. 

However, in the above CELP speech coder /decoder 

using an algebraic excitation vector generator, random 



excitation (target vector for specifying an index of 
random cpdebook) obtained by equation (3) is 
approximately expressed by a few signed'pulses . For this 
reason . there is a limitation in improvement of speech 
5 quality.. This is obvious from an actual investigation - 
- of an element for random excitation x of expression (3) 
wherein there are few cases in which random excitations 
are composed only of a few signed . pulses . . 
Disciosure of Invention 

10 Ari object of the preisent invention is to provide 

an excitation vector * generator , which is capable of 
generating an excitation vector whose shape has a 
statistically high similarity to the shape of a random 
excitation obtained by analyzing an input speech signal. 

15 Also, an object of the present invention is. to 

provide a CELP speech coder /decoder , a speech signal 
communication system, a speech signal recording system, 
which use the above excitation vector generator as a 
random codebook so as to obtain a synthetic speech having 

20 a higher quality than that of the case in which . an 
aigeb'r'aic excitation vector generator is used, as a random 
codebook. 

A first aspect of the present invention is to provide 
an excitation vector generator comprising a pulse vector 
25 generating section having N channels (N^l) for generating pulse 
vectors each having a signed unit pulse provided to one 
element on a vector axis , a storing and selecting section having 
a function of storing M (MSl)kinds of dispersion patterns every 



channel and a function of selecting a certain kind of dispersion 
pattern from M kinds of dispersion patterns stored, a pulse vecTor 
dispersion section having a function of convolving the dispersion 
pattern selected from the dispersion pattern storing and selecting 
5 section. to the signed pulse vector output from the pulse vector 
generator 30 as to~ generator N dispersed vectors, and a dispersed 
' vector adding section having a function of adding N dispersed 
vectors generated, by the pulse vector . dispersion secfxpn so as 
to generate an. excitation vector. The function, "for 

10 algebraically generating (N21) pulse vectors is provided to 
the pulse vector generator, and the dispersion pattern storing 
and selecting section stores the dispersion patterns obtained by 
pre-training the shape (chaxacteris tic) of the actual vector, 
whereby making it possible to generate the excitation vector , which 

15 is well similar to the shape of the actual excitation, vector as 
compared with the conventional algebraic excitation generator. 

• Moreover, the second aspect of the present invention 
is to provide a CELP speech coder /decoder using the above 
excitation vector generator as the random codebook, which 

20 is capable of generating the excitation vector being 
closer to the actua.1 shape than the case of the 
conventional speech coder / decoder using the algebraic 
excitation generator as the random codebook. Therefore, 
there . can be obtained the speech coder/decoder, speech 

25 signal communication system, and speech signal recording 
system, which can output the synthetic speech having a 
higher quality. 
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Brief Description of Drawings . 
. . FIG. 1 is a functional". block diagram of 

conventional CELP speech coder: 

FIG.- 2 is functional block diagram- of 

5. conventional CELP speech decoder; ■ 

FIG. 3 is a functional block diagram of an 
excitation vector generator according' to a first 
; ■ embodiment of -the. present invention; " ■ 

FIG . 4 is a functional block diagram of a CELP speech 

10 coder according to a second embodiment of the present . 

invention; •• , 

FIG. 5 is a functional block diagram of a CELP speech ■ 

decoder according to the second embodiment of the present 

invention ; . 

15 FIG. 6 is a functional block diagram of a CELP speech 

coder according to- a' third embodimeat of the present 
invention ; 

FIG. 7 is a functional block diagram of a CELP speech 
■ coder according to a fourth embodiment of the present 

20 invention ; 

■FIG. 8 is a functional block diagram of a CELP speech 

coder according t o • a fifth embodiment, of- the present 
invent ion ; 

FIG. 9 is a functional block diagram of a vector 
25 quantization function according to the fifth embodiment 
of the present invention; 

FIG. 10 is a view explaining an algorithm for a 
target extraction according to the fifth embodiment of 
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the present invention; . 

FIG. 11 is a functional block diagram of a 
predictive quantization according to the -fifth 
embodiment of the present invention; 
5 . FIG. 12 is. a functional block diagram of a 

predictive quantization according to a sixth embodiment 
-of the present invention; 

FIG. 13 rs a functional block diagram of" a CELP 
speech coder according to a seven th embodiment of the 
10 present invention; and 

FIG. 14' is a functional block diagram of a 
distortion calculator according to the seventh 
embodiment of the present invention.. 

15 Best Mode for Carrying Out the Invention 

Embodiments will now be described with reference 
to the accompanying drawings . ' 
(First embodiment) 
. FIG. 3 * is a functional block diagram . of an 
20 excitation vector generator according to a first 
embodiment of the present invention. 

The excitation vector generator comprises a pulse 
vector generator 101 having a plurality of channels, a 
dispersion pattern storing and selecting section 102 
25 having dispersion pattern storing sections and switches, 
a pulse vector dispersion section 103 for dispersing the 
pulse vectors,,' and a dispersed vector adding section 104 
for adding the dispersed pulse vectors for the plurality. 



of channels . 

The pulse vector generator 101 comprises N (a case 
of 'N.= 3 will be explained in t his embodimen t ) channels 
for generating vectors (hereinafter referred to aspulse 
5.. vectors) each having a signed unit pulse with provided 
to one element on a ^vector axis. 

. The dispersion pattern storing and selecting 
section 102 comprises storing sectio.ns Ml tb"~M3 for 
storing M (a case of M = 2 will be. explained in this 
10. ■ embodiment) kinds of dispersion patterns, for each 
channel and switches SWl to SW2 for selecting one kind, 
of dispersion pattern f rom M kinds of dispersion patterns 
stored in the respective storing sections Ml to M3 . 

The piilse vector dispersion section 103 performs 
15' convolution of the pulse vectors output from the pulse 
. vector generator 101 and the dispersion pa t terns, output 
from the dispersion pattern storing and selecting 
section 102.in every channel^so as to generate N dispersed 
vectors . 

20 The dispersed vector adding section 104 adds up N. 

dispersed vectors generated- by the purse vector 
• dispersion section 103 , thereby generating an excitation 
vector 105 , . 

. Note that, in this embodiment, a case in which the 
25 pulse vector generator 101 algebraically generates N 

(N = 3) pulse vectors in accordance with the rule described 
in Table 1 set forth below will be explained. 
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TABLE 1 



Channel Number 


Polarity 


Pulse Position Candidates 


CHI 


±1. 


/''(O, -10, 20, 30, •-. 60, 70)' 


CH2 


*1 


P'- 


'2, 12, 22, 32, 62, 72' 
6, 16, 26, 36, • ■, 66, 76 




. CH3 


■ ±1 : 


>3 


■4, 14, 24, 34, ■ 64, 74 
8, 18, 28, 38, ■••, 68, 78 





.. An . Operation of the abo ve - s t rue tur ed excitation 
. vector generator will t)e explained. 
5 . The dispersion pattern storing and selecting, 

section 102 selects a dispersion pattern by one kind by 
one from dispersion patterns stored two kinds by two for 
each channel, and outputs the dispersion . pattern . In 
this case, the number is allocated to each dispersion 
10 pattern in accordance with the combinations of selected 
dispersion patterns (total number of combinations: 

Next, the pulse vector generator 101 algebraically 

.? ■ ■ - 

generates the 'signed pulse vectors corresponding to the 

15 number of channels (three in this embodiment) in 
accordance with the r.uJLe described in Table 1. 

The pulse vector dispersion section 103 generates 
a dispersed vector for each channel by convolving the 
dispersion patterns selected by the dispersion pattern 

20 storing and selecting section 102 with the signed pulses 
generated, by the pulse vector generator 101 based on the 
following expression (5): 
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ci(n) = ^wj{n-k)di{k) - (5) 



w h e r e n : 0 - L - 1 , 
L: dispersion vector length, 
i: channel number, 
5 j: dispersion pattern number (j=l-M), 

ci: dispersed vector for channel i, 
■ wi j : dispersed pattern for channel i j wherein the 
vector length of wi j ( m ) is 2L - 1 ( m : - ( L - 1 ) - L - 1 ) ^ and it 
is the element, Li j , that can specify the value and the 
10 other elements are = zero , 

di: signed pulse vector for channel i, 
• di = ± 5 (n-pi) , n = 0~L-1 , and 

pi: pulse position candidate for channel i. . 
The dispersed vector adding section 104 adds up 
15 ttaree: dispersed vectors generated by the pulse vector 
dispersion section 103 by the following equation (6) so 
as to generate. the , excitation vector 105. 



cin) ^ ^ci(n) (6) 



where c: excitation vector , 
20 ci: dispersed vector, 

i : channel number ( i= l-N ) , and 

n: vector element number (n = O-L-l: note that L 
is an excitation vector length) . 

The above-structured excitation vector generator 
25 can generate various excitation vectors by adding, 
variations to the combinations of the . dispersion 
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■ patterns, which the dispersion pattern storing and 
selecting section 102 selects, and the pulse position 
and polarity in the pulse vector, which the pulse vector 
generator 101 generates. 
■5 Then, in the above^structured excitation vector 

: generator, it is ppssible to allocate bits to two kinds 
of inf ormation . having the combinations of dispersion 
patterns selected by the dispersion pattern storing and 
selecting section 102 and the combinations of the shapes 
10. (the pulse positions and polarities.) generated by the 
pulse vector generator 101. The indices of this 
excitation vector generator are in a one-to-one 
correspondence with two kinds of information. Also, a 
training processing is . executed' based on actual 
15 .excitation information in advance and the dispersion 
patterns obtainable as the training result can be stored 
; in th.e. dispersion pattern storing and selecting section 
. 102 . ' • . 

Moreover, the above excitation vector generator is 
20 used as the excitation information generator of speech 
coder/decoder to transmit two kinds of indices including 
the combination index of dispersion patterns selected 
by .the dispersion pattern storing and selecting section 
102 and the combination index of the configuration (the 
'25 pulse positions and polarities) generated by the pulse 
vector generator 101, thereby .making it possible to 
transmit information on random excitation. 

Alsio, the use of the above-structured excitation 



17 

vector generator allows the configuration 
(characteristic) similar to actual excitation 
information to be generated as compared with the use of 
algebraic codebook. 
5. The above embodiment explained the case in. which 

the dispersion pattern storing and selecting section. 102 
stored two kinds of dispersion patterns per one channel.. 
However, the similar function and effect can be obtained 
in a case in which the dispersion patterns other than 

10 . two. kinds are allocated to each channel. . . ■ 

Also, the above embodiment explained the^ case in 
which the pulse vector generator 101 was based on the 
three -channel structure and the pulse generation rule 
described in Table 1. However, the similar function a.nd 

15 effect can be obtained in a case in which the number of 
channels is different and a case in which the pulse 
generation . rule other than Table 1 is used as a pulse 
generation rule. . 

A speech signal communication system or a speech 

20 signal recording system having the above excitation 
vector generator or the speech, coder / decoder is 
structured, tjiereby obtaining the functions and esffects 
which the above excitation vector generator has. 
(Second embodiment) • . 

25 FIG. 4 shows a functional block of a CELP speech 

coder according to. the second embodiment, and FIG. 5 
shows a functional block of a CELP speech decoder. 

The CELP speech coder according to t hi s . embodiment 
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appli^ the excitation vector generator explained in the 
first embodiment to the random codebook of the CELP 
speech coder of FIG. 1. Also, the CELP speech decoder., 
according t.o^ this embodiment applies the excitation 
5 vector generator explained in the first embodiment to 
the random codebook of the CELP speech decoder of FIG. 
2. Therefore, processing other. than vector 

quantization p'rocessing for random excitation^ is the 
same as that of the apparatuses of FIGS. 1 and 2. This 
.10 embodiment will* explain the speech coder and the speech 
decoder with particular emphasis, on vector quantization 
processing for random excitation. Also, similar to the 
first embodiment^ the generation of pulse vectors are 
based on Table 1 wherein the number of channels N = 3 
15 and the number .of dispersion, patterns for one channel - 
: M ■= 2 , ■ ; . 

The vector .quantization processing for random 
excitation in ^the speech coder illustrated in FIG. 4 is 
one that specifies two kinds of indices (combination 
20 index for dispersion patterns and. combination index for 
pulse positions and pulse polarities) so as to maximize 
reference values in expression (4). 

In a case where the excitation vector generator 
illustrated in FIG. 3 is used as a random codebook, 
.25 combination index for dispersion pat terns .{ eight kinds) 
and combination index for pulse vectors (case 
considering the polarity: 16384 kinds) are searched by 
a closed loop. 



For this reason./ a dispersion pattern storing and 
selecting section 215 selects either, of two kinds of 
diapers ion pat terns store;d in the dispersion pattern 
storing and selecting section itself, and outputs the 
5 selec t ed disper s ion pattern to a pulse vector dispersion 
section 217. Thereafter, a pulse vector generator 216 
algebraically generates pulse vector s corresponding to 
. the number of "channels (three in this embodim'ent ) in 
accordance with the rule described in Table .1, and 
10 outputs the generated pulse vectors to the pulse vector 
dispersion section 217. 

The pulse vector dispersion section 217 generates 
a dispersed , vector for each channel by .a convolution 
calculation. The convolution calculation is performed 
15 on the basis of the expression (5) using the dispersion 
patterns selected, by the dispersion pattern storing and 
selecting section 215 and the signed pulses generated 
by the pulse vector genera^tor 216. 

A dispersion vector adding section 218 adds up the 
20 dispersed vectors obtained by the pulse vector 
dispersion sect ion 217/ thereby generating exc-itatipn 
vectors (candidates for random codevector s ) . 

Then, a distortion calculator 206 calculates 
evaluation values according to the express ion ( 4 ) using 
25 the random code vector candidate obtained by the 
dispersed vector adding section 218. The calculation on 
the basis .of the expression (4) is carried put with 
respect to all combinations of the pulse vectors 
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generated based on the rule of Table 1. Then, among the 
calculated values, the combination index for dispersion 
patterns and. the combination index for pulse vectors 
(combination of the pulse posit ions and the polarities) , 
5 which are obtained when the evaluation' value by the 
expression (4) becomes maximum and the maximum value are 
output to a code . nuniber specifying section 213. 

Next; the-dispersion pattern storing and serlecting 
section 215 selects the combination for dispersion 

10 patterns which is different from the previously selected 
combination for the dispersion patterns. Regarding the 
combination for dispersion patterns newly selected, the 
calculation of the value of expression (4) is carried 
•out with respect to all cpmbinatipns of the pulse vectors 

-15 generated by the pulse vector generator 216 based on the 
rule of Table 1. Then, among the calculated values , the 
combination index for dispersion patterns and the. 
combination index for pulse\vectors which are obtained 
when the value of expression (4) becomes maximum and the 

20 maximum value are output to the code indices specifying 

section 213 again. . ; 

The above processing Is repeated with respect to 
all combinations (total number of combinations is eight 
in this embodiment) seleptable from the dispersion 

25 patterns stored in the dispersion pattern storing- and 
selecting section 215. 

The code indices specif ying section 213 compares 
eight maximum values in total calculated by the 
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distort ion calculator 206, and selects the highest value 
of. all. Then, the code indices . specif ying section 213 
specifies two kinds of combination indices (combination 
index for dispersion patterns, combination index for 
5 pulse vectors ) , which are obtained when the highest value 
is geneiratetd, and outputs the specified combination 
indices to a code outputting section 214 as an index of 
, random codebooTc. 

On the other hand, in the speech decoder of FIG. 

10 5, a code, inputting, section 301 receives : codes 
transmitted from the speech coder (FlQ. 4), decomposes 
the received codes into the corresponding index of LPC 
codebook, the index of adaptive cpdebook, the index of 
random codebook (composed of two kinds of the combination 

15' index f or dispers ion pat terns and combination index for 
pulse vectors ) and the index of weight codebook. Then, 
the code inputting, section 301. outputs the decpmppsed 
indicies to a linear prediction coefficient decoder 302 , 
an adaptive codebook, a random codebook 304 , and a weight 

20 codebook 305. No te that, in the random code number, that 
the combination index for dispersion patterns is output 
to a dispersion pattern storing and storing section 311 
and the combination index for pulse vectors is output 
to a pulse vector generator 312. 

25 Then, the linear prediction coefficient decoder 

302 decodes the linear predictive code number, obtains 
the coefficients for a synthetic filter 309, and outputs 
the obtained coefficients to the synthetic, filter 309. 
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In the adaptive codebook 303, an adaptive codevector 
corresponding to the inde?c of adaptive codebook: is read 
from , . 

In the random codebook 304, the dis per s ion pat tern 
5 storing and selecting section 311. reads the . dispersion 
patterns corresponding to the combination index for 
- dispersion pulses in every channel, and outputs the 
result an tto a pulse vector dispersion section 3r3. The 
pulse vector generator 312 generates the pulse vectors 
10 corresponding to the combination index for pulse vectors . 
and corresponding to the number of channels , and outputs- 
the resultant to the pulse vector dispersion section 313, 
The pulse vector dispersion section 313 generates . a 
dispersed vector for each channel by convolving the 
15 dispersion patterns received from the dispersion pattern 
storing and selecting section 311 on the singed pulses, 
received from the pulse vector generator 312. Then, the 

generated dispersed vectors are output to a dispersion 

■ ■ ■ * *...■■* 

vector adding section 314 . ' The dispersion vector adding 
20 section 314 adds up the dispersed vectors of the 
respective channels generated by the. pulse vector 
dispersion section 313, thereby generating a random 
codevector . . . 

Then, an adaptive codebook gain and a random 
25 codebook gain corresponding to" the index of weight 
codebook are read from the weight codebook 305. Then, 
in an adaptive code vector weighting section 306, the 
adaptive codevector is multiplied by the adaptive 
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codebook gain* Similarly in a ranciom code vector 
weighting section- 307, the random codevectpr is 
. multiplied by the random codebook gain . Then, these 
resultants are output to an adding ' s ec t ion - 3 0 8 .. 
5 The adding section 308 adds up the above two code 

vectors mult iplied • by the. gains so as to generate an 
excitation vector. Then, the adding section 308 outputs 
. the generated excitation vector to the adaptive codebook 
303 toupdate a^ buffer or to the synthetic filter 309 
10 to excite the synthetic filter. 

The synthetic filter 309 is excited by the 
excitation vector obtained by the adding section 308, 
and reproduces a synthetic, speech 310\ Also, the 
adaptive codebook 303 updates the buffer by the 
15 excitation vector .received from the adding section 308. 

' In this, case , suppose that the dispiersion patterns 
obtained by pre-training are stored for each channel in 
the dispersion pattern storing and selecting section of 
FIGS. 4 and 5 such that a value of cos t f unct ion becomes 
20 smaller wherein the cost function is a distortion 
evaluation expression 7 ) in which the excitation vector 
described in expression (6) is substituted into c of 
expression ( 2 ) . ^- 




25 where x: target vector for specifying index of 
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random codebook, 

gc: random codebook gain, 
H: impulse response convolution matrix for synthetic 

filter-, 

c: random codevector, 
i: channel number (ii - 1-N), 
j: dispersion pattern number (j - 1-M) 
ci: dispersion vector for channel i, 
wij: dispersion patterns for channels i-th, j-th 

kinds , 

di : pulse vector for channel i, and 
L: excitation vector length (n ,= 0~L-1). 

The above embodiment explained the case in which 
the dispersion patterns obtained by pre-training were 
„ stored M by M. for each channel in the dispersion pattern- 
storing and selecting section such that the value of cost 
function expression (7) becomes smaller. 
However, in actual, all M dispersion patterns- do not have 
to be obtained by training. If at least one kind .of 
dispersion pattern obtained by training is stored, it 
is possible to obtain the functions and effects to 
improve the quality of the synthesized speech. 

Also, the above embodiment explained that case in 
which from all combinations of dispersion patterns 
stored in the dispersion pattern storing and selecting 
section stores and all . combinations of pulse vector 
position candidates generated by the pulse vector 
generator, the combination index that maximized the 



reference value of expression (4)- was specified by the 
closed loop.. However, the similar functions and effects 
can be obtained by carrying out a pre-selection. based • 
on other parameters (ideal gain for adaptive codevector, 
5 ■ etc. ) obtained before specifying the index of the random 
■cpdebobJc or by a open loop search. 

Moreover, a speech signal communication system or 
a speech signa-1 recording system having the above the 
speech coder / decoder is structured, thereby obtaining 
10 the functions and effects which the excitation vector 
generator described in the first embodiment has- 
(Third embodiment) 

FIG; 6 is a functional block of a CELP speech coder 
according to the third embodiment. According to this 

15 embodiment, in the CELP speech coder using the excitation 
vector generator of the first embodiment in the random 
codeboolc, a pre- select ion for dispersion patterns stored 
in the dispersion pattern storing and selecting section 
is carried out using the value of an ideal adaptive 

20 codebook gain obtained before searching the. index . of 
random codebook.. The other portions of ~ the random 
codebook peripherals are the same as those of the CELP 
speech coder of FIG. 4. Therefore, this embodiment will 
explain the vector quantization processing for random 

25 excitation in the CELP speech coder of FIG. 6. 

This CELP speech coder comprises an adaptive 
codebook 407, an adaptive codebook gain weighting 
section 409, a random codebook 408 constituted by the 



excitation vector generator explained in the first 
embodiment , a random codebook gain weighting section 410 , 
a synthetic filter 405, a. dis t ort ion. calcu la t or 4 0 6 , an 
indices specifying section 413/ a dispersion pattern 

. 5 storing and selecting section 415, a pulse vector 
generator 416, a pulse vector dispersion section 417 , 
a dispersed vector addirig section 4 18 , .and a. distortion 
power juding section 419. ^ ' 

In this case according to the above embodiment , 

.,10 suppose that at least one ofM (M = $2) kinds of dispersion 
patterns stored in the dispersion pattern storing and 
selecting section 415 is the dispersion pattern that is 
obtained from the result by performing a pre-training 
to reduce quantization distortion generated in vector 
15 quantization processing for random excitation 

In this embodiment, for simplifying the 
explanation, it is assumed that the number N of channels 
of the pulse vector generator is 3 , and the number M of 
kinds of dispersion patterns for each channel stored in 

'20 the disper s ion pattern storing and selecting section is 
2 . Also , suppose that'one of M (M =? 2 ) kinds of dispersion 
patterns is dispersion pattern obtained by the 
above-mentioned training, and other is random vector 
sequence (hereinafter referred to as random pattern) 
2.5 which is .generated by a random vector generator. 
Additionally, it is known that the dispersion pattern 
obtained by the above training has a relatively short 
length and a pulse-like shape as in wll of- FIG- 3. 
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In the CELP speech coder of FIG. 6, processing for. 
specifying the index of the adaptive codebook before 
vector quantization of random excitation is carried out.. 
Therefore, at the time when vector quantization 
5 processing of random excitation is carried out,, it is 
possible to refer to the index of the adaptive cod.ebdo}c 
and the ideal adaptive codebooJc gain (temporarily 
decided). In this embodiment, the pre - selection for 
dispersion pat terns is carried out using the value of 
10 the ideal adaptive codebook gain. 

More specifically, first, the- ideal value of the 
adaptive codebook gain stored in the code indices 
specifying section 413 just after the search for the 
index of adaptive codebook is output to th^ distortion 
15 calculator 40 6 . The distortion calculator 406 outputs 
the adaptive codebook gain received/from the code indices 
specifying section 413. to the adaptive code book gain 
judging section 419. . , 

The adaptive gain judging section 419 performs a 
20 comparison between the value of the ideal adaptive 
"codebook gain received from the distortion. calculator 409 
and, a preset threshold value. Next, the adaptive 
codebook gain judging section 419 sends a control signal 
for a pre-selection to the dispersion pattern storing and 
25 selecting section 415 based - on the result of the 
comparison. The contents of . the control signal will be 
explained as follows . 

More specifically, when the adaptive codebook gain 
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Ls larger than the threshold value as a result of the. . 
comparison> the control signal provides an instruction 
to select the dispersion pat tern obtained- by the pre- 
training to reduce the quant izat ion' distortion in vector 

.5 quantization processing for random excitations . Also, 
when the adaptive code gain is not larger than the 
threshold value as a result of the comparison , the control : 
signal provides an ins truction to carry out the pre^ 
selection for the dispersion pattjern .different from the 

10 dispersion pattern obtained from the- result of the 
. pre- training . 

As a consequence, in the dispersion pat tern, storing 
and selecting selection 415, the dispersion pattern of. 
M (M = 2) kinds, which the respective channels store, can 

15 be pre-selected in accordance with the value of the ideal 
adaptive codebook gain , so that the number of combinations 
of dispersion patterns can be largely reduced.- This 
eliminates the need of the distortion calculation for all 
the combinations of the dispersion patterns, and makes 

20 it possible . to efficiently perform the vector 
quantization processing for random excitation with a 
small amount of calculations. 

Moreover, the random codevec tor is pulse-like 
shaped when the value of the adaptive gain is large (this 

25 segment is determined as voiced) and is randomly, shaped 
when the value of the adaptive gain is small (this segment 
is determined as unvoiced). Therefore, since the random . 
code vector having a suitable shape for each of the voice 
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segment the speech sig.nal and the non-voice segment, can 
be used, the quality of the synthtic speech can be 
improved... 

Due to the simplification of the explanation , this 
5 embodiment explained limitedly the case, in which the 
number N of channels of. the pulse vector generator .was 
3 arid the number M of kinds of the dispersion patterns 
was 2 per. channel stored in the dis^persipn pattern storing 
and selecting section. However',, similar effects and 

10 functions, can be obtained in a case in which the number 
' of channels of the pulse vector generator and the number 
of kinds of . the dispersion patterns per channel stored 
in the dispersion pattern storing and selecting section 
are different from the aforementioned case. 

15 Also, due to the simplification of the explanation, 

the above embodiment explained the case in which one of 
M kinds (M = 2). of dispersion patterns stored in each 
channel was- dispersion pat-terns .obtained by the above 
• training and the other was random patterns. However,, if 

20 -at least one kind of - dispersion pat tern obtained. by the 
training is stored for each channel/ the similar effects 
and functions" can be expected instead of the above- 
explained case. 

Moreover, this embodiment explained the case in 

25 " which large and small information of the adaptive code book 
gain was used in means for perf orming pre - select ion of 
the dispersion patterns. However, if other parameters, 
showing a short-time character of the input speech are 
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used in addition to large and small information of the ; 
adapt ive codebook gain, the similar effects and functions 
can be further expected. 

Further, a -speech signal communication system or a 
5 speech signal recording system having the above the speech 
coder/decoder is structured, thereby obtaiining the 
functions and effects which, the excitation vector 
generator . descxibed in the first embodiment has. 

In the explanation of the above embodiment, there 

10 was ■ explained the method in which the pre-selection of 
. the. dispersion pattern was carried out using the ideal 
adaptive cpdebook gain of the current frame. at the time 
when vector quantization processing of random excitation 
was performed. However,, the similar structure can bei 

15 employed even in^ a case in which a decoded adaptive 
codebook gain obtained in the previous frame is used, 
instead of the ideal, adaptive, codebook' gain in the 
current frame. In this cas^e, the similar, effect s. can be . 
also obtained^ 

20 (Fourth embodiment) 

FIG, 7 is a functional block diagram of a CELP spee'ch 
coder according to the fourth embodiment. In this 
embodiment, in the CELP speech coder using the excitation 
vector generator of the first embodiment in the random 

25 codebook, a pre-selection for a plurality of dispersion 
patterns stored in the dispersion pattern storing and 
selecting section is carried out using available 
information at the time of vector quantization 



processing for random excitations . It is characterized 
that a value of a coding distortion (expressed by an S/N 
ratio), -that is generated in specifying the index of the 
adapt ive codeboolc , is. used as a= refer^ence of the 
5 pre -select ion . • . . . ' 

Note that the other portions of the random codebook 
peripherals are. th'e same as thpse of the. CEL.? speech coder 
of FIG. 4. Therefore, this embodiment will specif ically 
explain the vector quantization processing for random 
.10 excit.at ion 

As shown in FIG. 7, this CELP speech coder comprises' 
.an adaptive codebook. 507-, an adaptive codebook gain 
weighting section 509 a random codebook 508 constituted 
by the excitation vector generator explained in the first 
15 embodiment , a random codebook gain weighting' section SlO , 
a synthetic filter 505, a distortion calculator. 506, a 
code indices specifying section 513, a dispersion 
pattern storing and selecting section 515 , a pulse vector 

■3 ■ 

generator 516, a pulse vector dispersion section 517, 
20 a dispersed vector adding section 518, and a coding 
distortion judging section 51^. 

In this case, according to the above embodiment, 
suppose that at- least one of M (M = $2) kinds of dispersion 
patterns stored in the dispersion pattern storing and 
25 selecting section '515 is the. random pattern. 

In the above embodiment, for simplifying the 
explanation, the number N of channels of the pulse vector 
• generator is 3 and the number M of kinds of the dispersion 
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patterns is 2 per channel stored in the dispersion 
pattern storing and selecting section . Moreovei:, one. of 
M (M = 2) kin<is of dispersion patterns is the random 
pattern, and the oth~er is the dispersion pattern that 
5 is obtained as the " result of pre -training to reduce 
quantization distortion generated- in vector 
quantization processing for random excitations. 
: / In the CELP speech coder of FIG. 7 proces sing f or 

specifying the index of the. adaptive codebook is 

10 performed before vector quantization processing for 
random excitation. Therefore, at the; time when vector 
quantization processing of random excitation is carried 
out, it is possible to refer to the index of the adaptive 
codebook^ the ideal adaptive codebook gain (temporarily 

15 decided), and the target vector for searching the 
adaptive codebook. In this embodiment,- the pre- 
selection for dispersion patterns is carried out using 
the coding distortion (expressed by S/N ratio) of the 
adaptive codebook which can be calculated from the above 

20 three information. 

More" specif ically , the index of adaptive codebook 
• and the value of the adaptive codebook gain (ideal gain), 
stored in the code indices specifying section 513 just 
after the search for the adaptive codebook is output to 

25 the distortion calculator 506. The distortion, 
calculator 506 calculates the coding distortion (S/N 
ratio) generated by specifying the index of the adaptive 
codebook using the index of adaptive codebook received 
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from the code indices- specifying section 513, the 
adaptive codebgok: gain, and the target vector for 
searching the adaptive codebogk. .Then, the distortion 
eaicuiator 506 outputs- the calculated S/N value to the 
5 coding distortion judi.ng section 519. ■ " 

The coding distortion juding section 519 performs, 
a comparison between the S/N value received from the 
distortion calculator- 506 and a preset threshold value. 
Next, the coding distortion juding section 519 sends a 

10 control signal for a pre - select ion to the dispersion- 
pattern storing and selecting section 515 based on the 
result of the comparison . The . contents of the control 
signal will be explained as follows.. 

More specifically, when the S/N value is larger than 

15 the" threshold value as a result of the comparison, the 
control signal provides an instruction to select the 
dispersion pattern obtained by the pre-training to reduce 
the quantization distortion generated by coding the 
target vector for searching the random cpdebook. Also, 

20 . when the S/N value is smaller than thie threshold value 
as a result of the comparison, the control signal provides 
an instruction to select the non - pulse - like random 
. patterns . 

As a consequence, in the dispersion pattern storing 
25 and selecting selection 515, only one kind is pre -selected 
from M (M = 2) kinds of dispersion patterns, which the 
respective channels store, so that the number of 
combinations of dispersion patterns can be largely 
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reduced.. This eliminates the need of the distortion 
calculation for ail the combinations of the dispersion 
patterns, and make^.-it possible to efficiently specify 
the index of the random codebook with a small amount of 
5, calculations. 

Moreover, the random codevector is pulse-.like 
shaped when the S/N value is large, and is non-pulse- 
like/shaped when the S/N value is small. Therefoire, since 
the shape of the random codevector can, be changed in 
10 accordance with the short-time characteristic of the 
speech signal, the quality of the synthetic speech can 
be improved. 

Due to the simplification of the. explanation , this 
embodiment explained limitedly the case . in which the. 

15* number N of channels of the pulse vector generator was 
3 and the number M of kinds of the dispersion patterns 
was 2 per channel stored in the dispersion pattern storing 
and selecting section. However, similar effects and 
functions can be obtained in a case in which the number 

20 of channels of the pulse vector generator and the number 
of kinds of the dispersion patterns per channel stored, 
in the dispersion pattern storing: and selecting section 
are different from the aforementioned case. 

Also,, due to the simplification of the explanation, 

25 the above embodiment explained the case in which one. of 
M kinds (M = 2) of dispersion patterns stored in each 
channel was dispersion patterns obtained by the above 
pre - training, and the other was random patterns . However , 



if at least one kind of random dispersion. pattern is store,d 
for each channel, the similar effects and functions can 
be expected instead of the above - explained case-. 

Moreover, this embodiment explained the' case in 

r ■ 

5 which only large and ' small information of coding 
distortion' (expressed by S/N value) . generated by 
specifying the index of the adaptive codebook was used 
... in means for pre-seleeting the dispersion, pattern. 
However, if o ther information , which correctly shows the 
10 . short-time characteristic of the speech signal, is 
employed in addition thereto, the similar effects and 
functions can be further expected'. 

Further, a speech signal communication system or a 
speech signal recording system having the above the speech 
15 coder /decoder is structured, thereby obtaining the 
* functions and effects which the excitation vector 
generator described in the first embodiment has. 
(Fifth embodiment) 

FIG, 8 shows a functional block of a CELP speech 
20 coder according to the fifth embodiment of the present 
invention. According to this CELP speech coder, in an 
LPC analyzing section 600 performs a self - correlation 
analysis and an LPC analysis of input, speech data 601, 
thereby obtaining LPC coefficients. Also, the obtained 
25 LPC coefficients are' quantized so as to obtain the index 
of LDC codebook, and the obtained index is decoded so as 
to obtain decoded LPC . coefficient s . 

Next, an excitation generator 602 takes out 
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excitation samples stored in an adaptive codebook 603 and 
. a random codebook 604 (anadaptive code vector (or adaptive 
excitation) and random codevec tor . . ( or a' random 
excitation) ) and sends them to an LPC synthesizing section 
5 605 . . " ' ' . ■ • ■ ' 

The LPC .synthesizing section 605 filters two 
excitations obtained by. the excitation generator 602 by 
the decoded LPC coefficient obtained, by the LPC analyzing 
section 600, thereby obtaining, two synthesized 

10 excitations... 

' In a comparator 606, the relationship between two 
synthesized excitations obtained by the LPC synthesizing 
section 605 and the input speech 601 is analyzed so as 
to obtain- an optimum value (optimum gain) of two 

15 synthesized excitations. Then, the respective 

synthesized excitations , which are power controlled by 
the optimum value , are added so as to obtain an integrated 
synthesized speech, and a distance calculat ion ' between 
the integrated synthesized speech and the input speech 

20 is carried out. 

The distance; calculation between each of many_ 
integrated synthesized speeches, which are obtained by 
exciting the excitation generator 602 and the LPC 
synthesizing section 605, and the input speech 601 is 

25 carried out with respect to all. excitation samples of the 
adaptive codebook 603 and the random codebook 604 . Then , 
an index of the excitation sample, which is obtained when 
the value is the smallest in the distances obtainable from 



: . 37 , 

the result, is^ determined. 

Also, the obtained optimum gain*, the index of the 
excitation sample, and two excitations, responding to the 
index are sent to a -parameter coding se-ction 607. In the 
5 parameter coding section 607 , the- optimum gain is coded 
sp as to obtain a gain code, and the index of. LPC codebook 
and the index of the excitation sample • are s ent to a 
transmission path 608 at one time. 

Moreover, an actual excitation signal is generated 
10 from two excitations responding to the gain code and the 
index, and the generated excitation signal is stored in. 
the adaptive codebook 603 and the old excitation sample 
is abandoned at the same. time. 

Note that, in the LPC synthesizing section 605, a 
15 perceptual weighting filter using the linear predictive 
coefficients, a high-frequency enhancement filter, a 
long- term .predictive filter, (obtained by carrying out 
a long- term prediction analysis of input speech) are 
generally employed - Also, the excitation search for the 
20 adaptive codebook and the random codebook is generally 
carried out in segment s ( referred. to as subframes) into 
which an analysis segment is further divided. 

The following will explain the vector quant izat ion . 
for LPC coefficients in the LPC analyzing section 600 
25 according to this embodiment. 

FIG. 9 shows a functional block for realizing a 
vector quantization algorithm to be executed in the LPC 
analyzing section 600, The vector quantization block 
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shown in FIG. 9 comprises a target extracting section 702 / 
a quantizing section 703, a distortion calculator 704, 
a comparator 705 , a decoding vector storing section 707 , 
and a vector smoothing section 708. 
5 In the target extracting section 702 , a quant ization . 

target is calculated based on an input vector 701. Here, 
a ta.rget extracting method will be . specifically 
- explained . 

In this embodiment, the * input vector'* comprises two 
iO kinds of vectors in all wherein one is a parameter vector 

obtained by analyzing a current frame and the other is 
■ a parameter vector obtained from a future frame in a like 

manner. The target extracting section 7Q2 calculates a. 

quantization target using the above input vector and a 
15- decoded vector of the previous frame stored in the decoded 

vector storing •section 707. An example of the 

calculation method will be shown by the following 

expression {8), 

- {S,Xi).^pid(^)-^SUi)/^^^ (8) . 

20 , where . X(i) : target vector, 

i: vector element number, 
S.(i), S^^i(i): input vector, 
t: time (frame number). , 

p: weighting coefficient (fixed):, and 
25 d(i): decoded - vector of previous frame. 

The following will show a concept of the above target 
extraction method. In a typical vector quantization, 
parameter vector S\.(i) is used as target X(i) and a 



matching is performed by the follo.wing expression (9) 



, En = ^{Xii)-Cn{i)y •■ (9) ■ . 

where En : distance from n-th code vector, 

X(i): target vector, 

Cn(i): code vector, 

n:"code vector number, 

•i: order of vector, and 

I: length of .vector.' 
Therefore, in the conyentional.vector quantization, 

the coding distortion directly leads to degradation • in 
speech quality. This was a big problem in the ultra- 
low bit rate coding in which the coding distortion cannot 
be avoided to some extent even if measurements such as 
prediction vector quantization is taken. 

For this reason., according to this embodiment, 
attention should be paid to- a middle point of the decoded 
vector as a direction where the user does not perceptually 
feel an error easily, and the decoded, vector is induced 
to the. middle point so as to realize perceptual 
improvement.. In' the above case. there is used a 
characteristic in which time continuity is not easily 
heard as a perceptual degradation. 

The following will explain the above state with 
reference to FIG. 10 showing a vector space. 

First of all, it is assumed that the decoded vector 
of one previous frame.is d(i) and a future parameter vector 
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is 5,^^(1) (although a future coded vector is actually 
desirable, the future parameter vector is. used for the 
future coded vector since the coding cannot be carried 
out in the current frame. In this case, although the code 
5 vector Cn(i): (l)is closer to the parameter vector St ( i ) ■ 
than the code vector Cn(i): [2)\ the code vector Cn(i): 
(2) is . actually close onto a line connecting d{i) and 
Sd(i) . . For this reason degrada'tion is not easily heard 
;as compared with (1)*. Therefore, by use of the above 
10 charact eristic , if the target X(i) is set as a vector 
placed at the position where the target X{i) approaches 
to the middle point between d ( i ) a:nd- S^^i ( i ) f rom St(i) 
to some degree, the decoded vector is induced to a 
• direction where the amount of' distortion is perceptually 
15 slight . ^ ; 

Then according to this embodiment, the movement of. 
the target can be realized by introducing the following 
evaluation expression ( lOJ 

^(0, = {S,(i) + p{d{i)^S,Ji)r2}/{Up)' ••• (10). . 
20 where X(i) : target vector, . . 

i; vector element number, 

St(i). Sc^i{i): input vector , 

t : time ( frame number ) , 

p: weighting coefficient (fixed), and 
25 d(i): decoded vector of previous frame. 

The first half of expression (10) is a general 
evaluation expression , and the second half is a perceptual 
component. In order to carry out the quantization by the 
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above evaluation expression, the evaluation expression 
-is differentiated with respect to each X(i) and the 
differentiated result is set to 0 , so that expression ( 8 ) 
can be pbtained. 
5 . ' Note that the weighting coefficient p is a positive 
.constant. Specifically, when the weighting coefficient 
p- is zero, the result is similar to the general 
quantization when the weighting coefficient p is infinite , 
the target is placed at the completely middle point. If 

10 . the weighting coefficient p is too large, the target is 
largely separated from the parameter 3^(1) of the current 
■ frame so that articulation is percept ually reduced . The 
test listening of decoded speech confirms that a good 
performance with 0 . 5<p<l . 0 can be obtained. 

15 Next, in the quantizing section 703 , the 

quantization target obtained by the target extracting 
section 702 is quantized so as to obtain a vector code 
■and a decoded vector, and t,he obtained • vector index and 
decoded vector are sent to the distortion calculator 704. 

20 Note that a predictive vector quantization is used 

as a quantization method in. this embodiment. The 
following will explain the predictive vector 
quantization. 

FIG. 11 shows a functional block of .the predictive 

25 . vector quantization . The predictive vector quantization 
is an algorithm in which the prediction is carried out 
using the vector (synthesized vector) obtained by coding 
and decoding in the past and the predictive error vector 



is quantized. 

A vector qodebook 800, which stores a plurality of 
main, . samples ■ ( codevec tors ) ' of the prediction error 
vectors., is p.repared in advance. This is prepared by an 
5 ■ LBG algorithm (IEEE TRANSACTIONS ON COMMUNICATIONS , VOL . 
COM- 2 8, NO. 1, PP3 4 - 95, JANUARY 19 80 ) based on a' la"rge 
number o,f vectors obtained by analyzing a large amount 
of speech data. . 

A vector 801 for quantization target is predicted 
10 * by a prediction section 802. The prediction is carried 
out by the post-decoded vectors stored in a state storing 
section .803 , and the obtained predictive error vector is 
sent to a distance calculator 804 , Here, as a form of 
prediction, a first prediction order and a fixed 
15 coefficient are used. Then, an expression for calculating the 
predictive error vector in the case of using the above- 
prediction, is shown by the following expression (11). 
Y(i) = X(i) - j3D{i) . ^ (11) 

where Y(i): predictive eriror vector, .. ^ 

20 X(i): target vectpr, 

3: prediction .coefficient (scalar) 
D(i) : decoded vector of one previous 
frame , and 
' i; vector. order. 
25. In the above expression, it is general that the 

prediction coeff.icient i3 is a value of 0 < 5 < 1 : 

Next, the distance calculator 804 calculates the 
distance between the predictive error vector obtained by 
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the prediction section 802 and the codevector stored in 
codebook 800. An expression for obtaining the above 
'distance is shown by the . following express ion (12): 

■ EnA{T{L) - CniO) ■ (12) . ' ' 

5 . where En: distance from n^-th code vector, 

Y(i): predictive error vector, - 
-Cn ( i ) : codevector , 

n: codervector number , 
I: vector order, and 
10 I: vector length. 

Next, in a searching section 805 , the distances for 
respective codevectors are compared, and the index of 
codevector which gives the shortest distance is output 
as a vector code 806. 
15 In other words, the- vector codebook 800 and the 

distance calculator 804 are co.ntrolled so as to obtain 
the index of codevector which gives the shortest distance 
from all codevectors stored in ,the vector codebook 800, 
and the obtained ind.ex is used as vecto.r code 806. 
20 Moreover, the vector is coded using the code vector 

obtained from the vector codebook 800 and the past-decoded 
vector stored in the state storing section 803 based on 
the final coding, and the content of the state storing 
section 803 is updated using the obtained synthesized 
25 vector. Therefore, the decoded vector here is used in 
the prediction when a next quantization is performed. 

The decoding of the example (first prediction order. 
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fixed cpeffiGient) in. the above-mentioned prediction 
form is performed by the following expr e.s s ion ( 1 3 ) : 
Z(i) = :CN(i) -H i3D(i) ... (13) 

where 2(i).: decoded vector (used as P(i) at a next coding 
5 time/ 

N ; code f or vec tor , 
CN(i):. code vector, 

.3 : prediction coefficient (scalar), 
D(i): decoded vector of phe previous 
10 * frame, and 

i : vector order. 
9n the other hand, in a decoder, the code vector is ' 
obtained based on the code of the transmitted vector so 
as to be decoded . In the decoder, the same vector 
15 codebook and state storing section as those of the coder 
are prepared in advance. Then, the decoding is carried 
out by the same algorithm as the decoding function of the 
searching section in the aforementioned coding algorithm. 
The above is the vector quantization, which is executed 
20 in the quantizing section 703. 

Next., the distortion calculator 704 calculates a 
perceptual weighted coding distortion from the decoded 
vector obtained by the quantizing section 703, the input 
vector '701, and the .decoded vector of the previous frame 
25 . stored in the decoded vector storing section 707. An 
expression for calculation is shown by the following 
expression (14): 



£w = 5;(^-(o-5,(o 7+p{K(o-(rf(o+5,.,(o/2}— ••• (14)- 

where Ew: weighted coding distortion, 
■ St(i). S^,i( i).:. input vector , 
t: time (frame number) 
5 . ■ i: vector element number, 

V(i): decoded vector,. 

p: weighting coefficient (fixed), and 
-d(i): decoded vector of previous frame . 
In expression (14) , the weighting efficient p is the 

10 same as the coefficient of the expression, of the target 
used in the target extracting section 702,- Then, the 
value of the weighted coding distortion, the encoded 
vector and the code of the vector are sent to the 
comparator 70 5. 

15 The comparator 70 5 sends the code of the vector sent 

from the distortion calculator 704 to the transmission 
path 608, and further updates the content of the. decoded 
vector storing section 707 using the vector sent from the 
distortion calculator .704 . 

20 According to the above-mentioned embodiment, in the 

target extracting section 702, the target vector is 
corrected from 3^(1) to the vector placed at the position 
approaching to the middle point between D(i) and S^,^{t) 
to same extent. This makes it possible to perform the 

25 weighted search so as not to arise perceptual degradation . 

The above explained the case in which the present 
invention was applied to the low bit rate speech coding 
technique used in such as a cellular phone . However, the 
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present invention can be empioyed in not only the speech 
coding but also the vector quantization for a parameter 
having a. relatively good in.terpolat ion in a music coder, 
and an image coder. 
5 In general, the LPC coding executed by the LPC 

analyzing section in the above-mentioned algorithm, 
conversion to parameters vector such as LPS ( Line Spec tram 
Pairs), which are easily coded, is .commonly performed, 
and vector quantization (VQ) is carried out by Euclidean 

10 distance or weighted Euclidean distance. 

• Also, according to the above embodiment, the target 
extracting section.TOZ sends the input vector 7.01 to the 
vector smoothing section 708 after being subjected to the 
control of the comparator 705. Then, the target 

15 extracting section 702 receives the input vector changed 
by the vector smoothing section. 708, thereby re- 
extracting the target. 

In this/case, the comp^irator 705 compares the value 
of weighted coding dis tort ionT sen t from the distortion 

20 calculator 704 with a -reference value prepared in ■ the 
comparator. Processing is divided' into two, depending, 
on the comparison result. 

If the comparison result is under the reference 
value, the. comparator 705 sends the index of the, 
.25 codevector sent from the distortion calculator to the 
transmission path 608, and updates the content of the 
decoded vector storing section 707 using the cpdeid vector 
sent from the distortion calculator 704. This update -is 
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carried out by rewriting the content of the decoded vect.or . 
storing section 707 using the obtained coded vector. 
Then, processing moves to one for a next frame parameter 
• coding . . - 

5 While, if the comparison result is more than the • 

.reference value, the comparator 705 controls the vector 
smoo thing section 7 0.8 and adds a change to the input vector 
so that the target extracting section 702, the quantizing 
section 703 and distortion calculator 704 are functioned 

10 again to -perform coding again. 

In the comparator 7 0 5, coding processing is repeated 
until the comparison result reaches the value under 
reference value* However, there is a case in which the 
comparison result can not reache the value, under the 

13 reference value even if coding processing is repeated many . 
times'. In case, the comparator 705 provides a counter 
in its interior, and the counter counts the number of times 
wherein the comparison result is determined as being more 
than the reference value. When the number of times is 

20 more than a fixed number of times , the comparator 705 stops 
the- repetition of coding and clears the -comparison result 
and counter state, then adopts initial index. 

The vector smoothing section 708 is subjected to the 
control of the comparator 705 and changes parameter vector 

25 Sc(i) of the current frame, which is one of input vectors, 
from the input vector obtained by the target extracting 
section 702 and the decoded vector of the previous f rame . 
obtained decoded vector storing, section 707 by the 
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following expression (15), and sends the changed input 
vector to the target extracting section 702, 

. S,(i) (1-q) • SJi)^ q(d(i) + S,.,(i) )/2 ...(15) 
In the above expression, q is a smoothing 
5 cdef f i.cient , which shows the degree of which the parameter 
vector of th*e current frame is updated close to armiddle 
point between the decoded vector of the previous -frame 
and the parameter vector of the ; future .frame . The coding, 
experiment shows that good performance can be obtained 
10/ when the upper limitation, of the number of repetition 
execiited/by the interior of the comparator 705 is 5 to: 
8 under the condition of .0. 2 < q <0.4. 

Although the above embodimeht uses the predictive 
vector quantization in the quantizing section 703, there 
15 is a high. possibility that the weighted coding distortion 
obtained by the distortion calculator 704 will become 
small. This is because, the quantized target is updated 

closer to the decoded vector of the previous frame by 
. ■ ■- ^ ■ ■ • ' ' 

smoothing. Therefore, by the repetition of decoding the- 

20 previous frame due to the control of the comparator 705, 
the possibility that the comparison result will become 
under the reference value is increased in the distortion 
comparison of the comparator 705. 

Also, in the decoder, there is prepared a decoding 

25. section corresponding to the quantizing section of the 
coder in advance such that decoding is carried out based 
on the index of the codevector transmitted through the 
t rans mis s ion path . 
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Also, the embodiment of the present invention was 
applied to. quantization (quantizing section is 
prediction VQ ) of LSP parameter- . appear ing CELP speech 
coder, and speech coding and decoding experiment was 
5 performed. As a result, it was confirmed that not only 
- the. sub j ective quality but also the ob j ect ive value ( S/N 
value) could be improved. This is because. there is. an 
effect in which the coding distortion of predictive VQ 
can be suppressed by coding repetition processing having 

10 vector smoothing even when the spectrum drastically 
changes.; Since the future prediction VQ; was predicted 
from the pas t - de-cpded. vector s , there was a- disadvantage 
in which the spectral disftprtion of the portion where the 
spectrum drastically changes • such as a speech onset 

15 Gontrarily increased. However, in the application of the 
embodiment of the pres ent inyen t ion , s ince smoothing is 
carried out until the distortion lessens in the case where 
the distortion is large, the coding distortion becomes 
small though the target is more or less separated from 

20' the actual parameter vector.. Whereby, there can- be ' 
obtained an effect in which degradation caused when 
. decoding the speech is totally reduced. Therefore, 
according to the embodiment, of the present invention, not 
only the sub j uctive. quality but also the objective value 

25 can be improved^ 

In the above-mentioned embodiment of the present 
invention, by the chairacteris tics of the comparator and 
the vector smoothing section, control can be provided to 



the direction where the operator does not perceptually 
feel the direction of degradation in the case where the 
. vector quantizing distortion is large , Also, in the case 
where predictive vector quantization is used in the 
5 quantizing section, smoothing and coding are repeated, 
' until : the coding distortion lessens., thereby the 
objective value can be also improved. 

The above' explained the case in which the"" present 
invention was applied to the' low- bit rate speech coding 
10 technique used in such as a cellular phone . However, the 
present invention can be employed in not only the speech 
coding but also the vector quantization for a parameter 
having a relatively good interpolation in a music coder 
and an image coder. 
15 . (Sixth embodiment) 

Next, the following will explain the CELP speech . 
coder according to the sixth embodiment. The 
configuration of this embodiment is the same as that of 
the fifth embodiment excepting quantization algorithm of 
20 the quantizing seiction using a multi-stage predictive 
vector quantization as a quantizing method. In other 
words, the excitation vector generator of the first 
embodiment is used as a random codebook. Here, the. 
quantization algorithm of the quantizing section will be 
25 specifically explained. 

FIG. 12 shows the functional block of the quantizing 
section. In the multi-stage predictive vector 

quantization, the vector quantization of the target is 
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carried put, thereafter the vector is decoded using a 
codebook with the ■ index of the quantized target, a 
difference between the coded vector.. Then/ the original 
target { her einafter . referred to as coded distortion 
5 vector) is obtained-, and the obtained coded distortion 
vector is further vector- quant ized. 

A vector codebook 89.9 in which a plurality of 
. dominant samples ( codevec tors ) of the predictive error 
vector are stored- and a codebook 900 ^are generated in 

10 advance . These codevectors are generated by applying the 
same algorithm as that of the codevector generating method 
of the typical ''multi- vector quantization". In other 
words, these codevectors are generally generated by an 
LEG algorithm (IEEE TRANSACTIONS ON. COMMUNICATIONS , VOL. 

15 COM-28/ NO. 1, PP84-95, JANUARY 1980) based on a large 
number of vectors obtained by analyzing many speech data . 
Note that, a training date for designing codevectors 899 
is a set p£ many target vectors while a training date 
for des igning codebook 900 is a set of coded distortion 

20. vectors obtained when the above^quantized targets are 
coded by the vector codebook 899. 

First, a vector 901 of the target vector is predicted 
by a predicting section 902. The prediction is carried 
out by the past-decoded vectors stored in a state storing 

25 section 903, and the obtained predictive error vector is 
sent to distance calculators 904 and 905. 

According to the above embodiment, as a form of 
prediction, a fixed coefficient is used for a first order 
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prediction.. Then, an expression for caicuiating the 
p"redictive error vector' in the case of using the above 
prediction is shown by the follo.w.ing expression (. 16) . 
Y(i) - X(i) - i3'D(i) ... (ley 

5 . where. Y(i) :. predictive' error vector, 

X( i ): target vector , 

j3 : predictive coefficient (scalar) / 
D(i').: decoded vector of one previous 
; . frame , and 
10 i: vector order. 

In the above expression, it is general that the 
predictive coefficient j3 is a value of 0 < i3 < 1 . 

Next, the distance calculator 904 calculates the 
distance between the predictive error. vector obtained by 
15 the prediction section 902 and code vector A stored. in 
the vector codeboo.k: 899. An expression for obtaining the 
above distance is shown by the following expres*sioh { 17 ) : 
■ - 2 • ■ . ■ ■. . ./ ■ 

where En: distance from n-th code vector. A 
20 Y(i): predictive error vector, . 

.Cln(i): codevector A, 

n: index of codervector A, 
I: vector order, and 
I: vector length. 
25 Then, in a searching section 906, the respective 

distances from the codevector A are compared, and' the 
index of the code vector A having the shortest distance 



is used as a code for code vector A. In other words,- the 
vector codebook 899 and the distance calculator 904 are 
controlled so as to ob.tain the code of codevector A having, 
the shortest-distance from all codevectors stored in the 
codebook 899. Then, the obtained code of codevector A 
is used as the index of'codeboojc 899. After this . the 
code for codevector A and decoded vector A obtained from 
the codebook-89^ with, reference to the code for codevector 
A are. sent to the distance calculator 905 , Also, the code , 
for codevector.A is sent to a searching section 906 through 
the transmission path.. 

The distance calculator 905 obtains a coded 
distortion vector from the predictive error, vector and 
the decoded vector A obtained from the searching section 
906 . Also; the distance calculator 905 obtains amplitude 
from an amplifier storing section 908 with reference to 
the code for codevector A obtained from the searching 
section 906. Then, the distance calculator 905 
calculates a 'distance by multiplying the above coded 
distortion vector and codevector B stored in the vector 
codebook 900 by the above amplitude,- and sends the; 
obtained distance to the searching section 907. An 
expression for the above distance is . shown as follows: 
Z(i) = Y(i) - ClN(i) 

Em ^ 

y{Z(iyaNC2mii)) (18). 

where Z(i): decoded vector, 

Y(i) : predictive error vector. 



ClN(i); decoded vector A," 

Ern : distance from m-th code vector B, 
aN : amplitude corresponding to the code for 
code vector A, . 
5 . C2m(i)' : codevector B, 

. m : index of codevector B, 

. i : vector order, and ' 

I : yectpr length > 
Then, in a searching section 90 7 , the respect ive 
10 distances from the codevector B. are compared, and the 
index of the codevector B having, the shortest distance 
,is used as a code for codevector . B . In other words, the 
cpdebook 900 and the distance calculator 905 are 
controlled so as to obtain the code . of codevector B having 
15 the shortest distance from all codevectors stored in the 
vector codebook 900. Then, the obtained code of 
codevector B is used as the index of codebook 900. After 
this, codevector. A and codevector B are added and used 
as a vector code 909/ 
20 Moreover the searching section 907 carries out the 

decoding of the vector using decoded vector s A, B obtained 
from the vector, codebooks 899 and 900 based on the codes 
for codevector A and codevector B , amplitude obtained from 
an amplifier storing section 908 and past decoded vectors 
25 stored in the state storing section 903. The content of 
the state storing section 903 is updated using the 
obtained decoded vector. (Therefore, the vector as 
decoded above is used in the prediction at a next coding 
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time). The decoding in the. prediction (a first 

prediction order and a fixed coefficient) in this embodiment 
is performed by the following expression . (.19 ) : 

•= " Z( i) = ClN(i) + aN-C2M(i) \+ f D{i) . ' ... (19) 

where Z(i) : decoded vector ( used as D( i) at the next 
codin~g time) / 
N: code for codevector A. 
M : code - for codevector B , ^ ■ 

ClN(i): decoded codevector A, 
C2M(i): decoded codevector B,. 

aN : amplitude corresponding to the code for 
codevector A, 
j3 : predictive coeff icient { scalar ) , 
p(i): decoded vector of one previous 
frame , and 
i : vector order , 
Also, although amplitude stored in the amplifier 
storing section 908 is pres.et, the setting method is set 
forth below. The amplitude is set by coding much speech 
data is coded, obtaining the sum of the coded distortions 
of the following expression (20), and performing the 
training such that the obtaine;d sum is minimized. 

; = y V(l^(0-CU^(0-'^^C2;ri,(0) ■•• (20) ' ' 

where EN: coded distortion when the code for 
. codevector A is N, 
N: code for codevector A, 

t: time when the code for codevector A is. 



56 



N , . 

Yt(I): predictive error vector at time t, 
ClN(i) : decode (i codeveetor A , . 

aN: amplitude corresponding to the code for 
codeveetor A, ' ; 

, C2m, ( i ) : codeveetor B , 

i: vector order, and 
I": .vector length. ■ ~ 

In other words after coding, amplitude is reset 
such that the value. which has been obtained by 
differentiating the distortion of the above expression-. 
(20) with respect to each amplitude, becomes zero, thereby 
performing the . training of amplitude. Then, by the 
repetition of coding and training, the suitable value of 
eac;h amplitude is obtained. 

On the other hand, the decoder performs the decoding- 
by obtaining the codeveetor based on the code of the vector 
transmitted. The decoder, comprises, the same vector 
codeboolcs (corresponding to codebpoks A. B) as those of 
the coder, the amplifier storing section, and the state 
storing section. Then, the decoder carries out the; 
decoding by the same algorithm as the decoding function 
" of. the searching section ( corresponding to the codeveetor 
B) in the aforementioned coding algorithm. 

Therefore, according to the above-mentioned 
embodiment, . by the characteristics of the amplifier 
storing section and the distance calculator, the code 
vector of the second stage is applied to that of the first 
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stage with a relatively small amount, of calculations, 
thereby the coded distortion can be reduced. 

The above explained the case in which the present 
invention was applied. to the low bit rate speed coding 
.5 . technique used in such as a cellular phone . However, the 
present invention can be employed in not only the speech 
coding, but also, the vector quantization for a parameter 
having a relatively good interpolation in. a mus-ic coder 
and an image coder. ■ 
10 . - (Seventh embodiment) 

Next, the. following will explain the GELP speech 
coder according to the sixth embodiment . This embodiment 
shows an example of a coder, which is capable of reducing 
the number of calculation steps for vector quantization 
15 processing for ACELP.type random codebook. 

FIG. 13 shows the functional block of the CELP speech 
coder according to this embodiment. In this CELP speech 
.coder, a filter coefficient analysis section 1002 
provides the linear predictive analysis to input speech 
20 signal 1001 so as to obtain coefficients of the synthesis 
filter, and outputs the obtained coefficients of the 
synthesis filter to a filter coefficient quantization 
section 1003. The filter coefficient quantization 
■ section 1003 quantizes the input coefficients of the 
25 synthesis filter and outputs the quantized coefficient s 
to a synthesis filter 1004. 

The synthesis filter 1004 is constituted by the 
filter coefficients supplied from the filter coefficient 



quantization section 1003. The synthesis. filter 1004 is 
excited by an excitation signal' 1011. 'The excitation 
signal 1011 is obtained by ^dding a- signal, which is 
obtained by mult ip lying an adaptive code vector 1006 ^ i :.e . 
5 ah output from an adaptive codebook 100 5, by an adaptive 
codebook gain 1007, and a signal/ which is obtained by 
multiplying a random codeve.Gtor 1009 , i.e." ; an output from 
a random.. codebook 1008 , by a. random codebook ga'ln loio .. 
Here,, the adaptive codebook 1005 is one that stores 
10 . a plurality of adaptive codevectors , which extracts the 
. past excitation signal. for exciting, the synthesis filter 
every pitch cycle. The random codebook . 100 7 is one that 
. stores a plurality of random codevectors. The random 
codebook 1007 can use the excitation vector generator of 
15 this aforementioned first embodiment . 

A distortion calculator 1013 . calculates a 
distortion between a synthetic speech signal 1012, i.e. / 

the output of the synthesis filter 1004' excited by* the 

. • ■ • ' •? ... 

excitation signal 1011, and the input speech signal 1001 
20 so' as to carry out code search processing. The code; 
search processing is one that specifies the index of^the 
adaptive codeveGtor 1006 for minimizing the distortion 
calculated by the distortion calculator 1013 and that of 
the random gain 1009. At the same, time, the code search 
25 processing is one. that calculates optimum values of the 
adaptive codebook gain 1007 and the random codebook gain 
1010 by which the respective output vectors are 
multipiied. 
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A code output section 1014 outputs the quantized 
value of the filter coefficients .obtainable from the 
filter coefficient quantization , section. 100 3 the index 
of the adaptive codevector 1006 selected by the distortion 
calculator 1013 and that of the random codevector 1009 , 
.and the quantized values of adaptive codebook gain 1007 
and random codebook gain' 1009 by which the respective 
output vectors-are multiplied. The outputs from the code 
output section 1014 -are transmitted or stored . . 

In the code search processing in the distortion-, 
calculator 1013, an. adaptive codebook component of the 
excitation signal is first searched, and a codebook 
component, of the excitation s ignal is next searched. 

The above search of the random codebook component 
uses an orthogonal search set forth below. 

The orthogonal search specifies a random vector c/ 
which maximizes a search * reference - value Eort ( = 
Nort/Dort) of expression ,f21). 

^\_^ \{{P^H^Hc)x-{x^Hp)Hp]HcJ ; ^^^^ 
"Don] {c'H-HcXp^H'Hp)HP'H'Hcy 

where Nort : numerator term for--Eort, 

Dort : denominator term for Eort , 

p.: adaptive codevector already specified, 

H: synthesis filter coefficient matrix, 

: transposed matrix for H, 

X: target signal (one that is obtained by 

differentiating a zero input response 

of the synthesis filter from the 



Eort 
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input speech signal), and *^ 
c: random codevector. 
The orthogonal search, is a search method for 
orthpgonalizing random cpdevector-s serving as candidates 
5 v/ith respect to. the adaptive vector specified in advance 
so as to specify index that minimizes the distortion from 
the plurality of or thogonalized random codevectors, . .The 
orthogonal searrch has the characteristics in "-which a 
accuracy, for the random codebook search can be improved 
10 as compared with a ndn^orthogonal search and the quality 
of the synthetic speech can be improved. 

In :the ACELP type speech coder, the random 
codevector is constituted by a few signed pulses. By use 
of the above characteristic, the numerator term (Nort) 
15 of the search reference value shown in expression (21) 
is deformed to the following expression (22) so as to 
■ reduce the number of calculation steps on the numerator 
term . 

Nort - ^]ilJ{lo)^a,Tp(l^ (22) 
20. where a^ : sign of i-th pulse (+1/-1), 

. 1^: 'p_osition of i-th pulse, 

N:. number of pulses, and 
(S> t { (p'=H'Hp)x- (x'^Hp).Hp}H. 
If the value of <f> of expression (22) is calculated 
25 in advance as "a .pre-processing and expanded to an array , 
(N-1) elements out of array cf) are added or substituted, 
and the resultant is squared, whereby the numerator term 
of expression (21) can be calculated. 



Next, the following will specifically explain the 
distortion calcula t or - 1 0 1 3 , v/hich is:cap,able of reducing 
the number of calculation steps on the denominator term. 
\ FIG. 14 shows the functional block of. the distortion 

• 5 calculator 101.3 . . The speech coder of this embodiment has 
the configuration' in which the adaptive codevector 1006 
and the random codevector- 1009 in the configuration of 
. FIG . 13 are input to the distortion calculator" 1013 . 

In FIG, 14, the following three processing is 
10 carried out as pre-processing at the time of calculating 
the distortion for each random codevector. 

. (1) Calculation of first matrix (N): power of 
synthesized adaptive . codevector (p^H.^Hp) and self- 
correlation matrix" of synthesis filter ' s coefficients 
15 . (H'^H) are computed, and each element of the self- 
correlation matrix are multiplied by the above power so 
as to calculate matrix N ( = { p'H'Hp ) H'H ) . 

(2) Calculate second matrix (M): time reverse 
synthesis is performed to the synthesized adaptive 

20 codevector. for producing (p'^H^'H) , and outer products of. 
the above resultant signal (p^H'^H) is calculated for. 
producing matrix M. 

(3) Generate third matrix (L) : matrix M calculated 
in item ( 2) is subtracted from matrix N calculated in item 

25. (1) so as to generate matrix L. 

Also , the denominator term (Dort) of expression (21) 
can be expanded as in the following expressions (23). 
Dort = (c'H'^Hc) (p'H'Hp) ^ (p^^H'Hc)' (23) 
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= c'Nc- (r'c)' - . ' 

= c'Ng - ( r'c ) ' ( r'c ) - 
= c'Nc- (c'rr'c ) . 
= -c'Nc- (c'Mc) 
5 = c'=(N-M)c . 

- c%c 

where N : ( p'^H'^Hp ) H^H the. above pre -p roc e s s ing ( 1 ) , 
' r: p^H'^H the above pre-procesS"ing (2) , 

M: rr^ the above pre-processing ( 2 ) , 

10 L: N-M the above pre-processing .( 3 ) , 

c: random codevector 
Thereby, the calculation of the denominator term 
, (Dprt) at the time of the calculation of the search 
reference value ( Eort ) of expression ( 2 1 ) is replaced with 
15 expression ( 23 ) , thereby making it possible to specify 
the random codebook component with the smaller amount of 
calculation. 

The. calculation of the denominator term is carried 
out' using the matrix L obtained in the above pre- 
20 processing and the random codevector 1009. 

Here, for simplifying the explanation, the 
- calculation method of the denominator term will be 
explained on the basis of expression (23) in a case where 
a sampling frequency of the. input speech signal is 8000 
25 Hz, the random codebook has Algebraic structure ,and its 
codevectors are constructed by five signed unit pulses 
per 10 ms frame. 

The. five signed unit pulses constituting the random 
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vector have pulses each selected from the candidate 
positions deFined for each of zero to fourth groups shown 
in Table 2 , then random vector c can be described by. the 
following expression (24).. 

C = a,5 (k-lo) + a,5 (k-lj + •■; +a,5(k-lj .• • (.24) 

(k^O, 1,. •••79) 

where a^ : s ign ( + 1 /.- D of pulse belonging to 
"group i , and , • . 

1^1 position of pulse belonging to group i. 
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Group 
Number 


Code 


Pulse Candidate Position 


0 


±1 


0. 10, 20, 30; 60, 70 


1 


±1 


2, 12, 22, 32, ■-, 62, 72 


2 


±1 


2. 16. 26, 36. 66, 76 


3 


±1 


4, 14. 24. 34, 64, 74 


' ^4 


±1 


8, 18, 28, 38, 68, 78 



At this time, the denominator terra (Dort) shown by 
' expression . (23) can be ^obtained by the . following 
15 expressioh (25): 
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Pore = 2^a,a,L{l,,lj) ••• (25) 



1-0 ;-0 

Where a^: sign .( + 1/ - 1 ) of -pulse belonging to. . 
group i , 

1^: position of pulse belonging to group i, and 
L(li., Ij): element (1,. row. and Ij column) of matrix L. 
, AS explained above, in the case where the ACELP type random 
codebookis used, the numerator term (Nort ) of the code search 
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reference value of expression (21) can be calculated by 
expression (22), while the denominator term (Dort) can 
■ be calculated by expression (25). Therefore, in the use 
of the ACELP type random codebook, the numerator term is- 
5 calculated by expression ( 22 ) and the denominator term 
is calculated by expression (25), respectively, instead 
of directly calculating of the reference value of expression (21) 1 
This makes it possible to greatly reduce the number of calculation 
steps for vector quantization processing of random excitations. 

10 , The aforementioned embodiments explained the random code 

' search with no pre -selection. However, the same effect as 
mentioned above can be obtained if the present invention is applied 
to a case in which pre-selection based on the values of expression 
(22) is employed, the values of expression (21) are calculated 

15 ' for only pre-^selected random codevectors with expression (22) and 
expression (25), then finally selecting one random codevector, 
which maximize the above search reference value. 



